
OIPE 


3 

5 

7 

8 

10 

12 

13 

14 

15 

16 

17 

18 
20 
21 
22 

23 

24 

25 
27 

C--> 28 
C--> 29 

32 

33 

34 

38 

39 

40 

41 

43 

44 

45 

46 
49 

51 

52 

53 

54 

55 

W--> 57 
58 
60 
62 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: NANCY CARRASCO, ET AL. 

(ii) TITLE OF INVENTION: THYROID SODIUM/IODIDE SYMPORTER AND 

NUCLEIC ACID ENCODING SAME 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: AMSTER, ROTHSTEIN & EBENSTEIN 

(B) STREET: 90 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10016 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH 1.44 Mb STORAGE 

DISKETTE 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: ASCII 
(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/995,007 

(B) FILING DATE: 26-NOV-2001 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US/08/595,553A 

(B) FILING DATE: FEBRUARY 1, 1996 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CRAIG J. ARNOLD 

(B) REGISTRATION NUMBER: 34,287 

(C) REFERENCE/DOCKET NUMBER: 96700/393 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 697-5995 

(B) TELEFAX: (212) 286-0854 or 286-0082 

(C) TELEX: TWX 710-581-4766 
(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2839 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 

(D) TOPOLOGY: LINEAR f~ 

(ii) MOLECULE TYPE: 1 

(A) DESCRIPTION: OLIGONUCLEOTIDE 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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64 


(vi) 

ORIGINAL SOURCE: 









65 



(A) ORGANISM: 

RAT 









-> 66 



(C) INDIVIDUAL ISOLATE 

: SODIUM/IODIDE 

SYMPORTER 


68 


(xi) 

SEQUENCE DESCRIPTION: SEQ ID NO 

i: 1: 





70 

GAATTCCGGG TCGACCACGC GTCCGGCGGT 

GACTCGCGCT 

GCGACTCTCC 

50 

72 

CACTGACCGA GAGTCCCCGA CGTCCTCCGC 

ATCCTCTCCT 

CACCGAGTCA 

100 

74 

CCTGTCTCC ATG GAG GGT GCG GAG GCC GGG GCC CGG GCC ACC 

142 

75 



Met Glu Gly Ala Glu Ala Gly Ala Arg Ala Thr 


76 




1 



5 





10 



78 

TTC 

GGC 

GCC 

TGG GAC 

TAC 

GGC 

GTG 

TTC 

GCG 

ACC 

ATG 

CTG 

CTG 

184 

80 

Phe 

Gly 

Ala 

Trp Asp 

Tyr 

Gly 

Val 

Phe 

Ala 

Thr 

Met 

Leu 

Leu 


81 




15 




20 





25 


83 

GTG 

TCC 

ACG 

GGC ATC 

GGG 

CTA 

TGG 

GTC 

GGC 

CTG 

GCC 

CGC 

GGT 

226 

85 

Val 

Ser 

Thr 

Gly He 

Gly 

Leu 

Trp 

Val 

Gly 

Leu 

Ala 

Arg 

Gly 


86 




30 





35 






88 

GGC 

CAA 

CGC 

AGT GCC 

GAC 

GAC 

TTC 

TTT 

ACC 

GGG 

GGC 

CGG 

CAG 

268 

89 

Gly 

Gin 

Arg 

Ser Ala 

Asp 

Asp 

Phe 

Phe 

Thr 

Gly 

Gly 

Arg 

Gin 


90 

40 




45 





50 





92 

TTG 

GCA 

GCC 

GTT CCT 

GTG 

GGG 

CTG 

TCG 

CTG 

GCC 

GCC 

AGT 

TTC 

310 

93 

Leu 

Ala 

Ala 

Val Pro 

Val 

Gly 

Leu 

Ser 

Leu 

Ala 

Ala 

Ser 

Phe 


94 


55 




60 





65 




96 

ATG 

TCG 

GCT 

GTG CAG 

GTG 

CTC 

GGG 

GTC 

CCC 

GCC 

GAG 

GCA 

GCG 

352 

97 

Met 

Ser 

Ala 

Val Gin 

Val 

Leu 

Gly 

Val 

Pro 

Ala 

Glu 

Ala 

Ala 


98 



70 




75 





80 



100 

CGC 

TAC 

GGG 

CTC AAG 

TTC 

CTG 

TGG 

ATG 

TGC 

GCG 

GGT 

CAG 

TTG 

394 

101 

Arg 

Tyr 

Gly 

Leu Lys 

Phe 

Leu 

Trp 

Met 

Cys 

Ala 

Gly 

Gin 

Leu 


102 




85 




90 





95 


104 

CTC 

AAC 

TCG 

CTG CTC 

ACA 

GCG 

TTT 

CTC 

TTC 

TTG 

CCG 

ATC 

TTC 

436 

105 

Leu 

Asn 

Ser 

Leu Leu 

Thr 

Ala 

Phe 

Leu 

Phe 

Leu 

Pro 

lie 

Phe 


106 




100 





105 






108 

TAC 

CGC 

CTG 

GGC CTT 

ACC 

AGC 

ACC 

TAC 

CAG 

TAC 

CTA 

GAG 

CTG 

478 

109 

Tyr 

Arg 

Leu 

Gly Leu 

Thr 

Ser 

Thr 

Tyr 

Gin 

Tyr 

Leu 

Glu 

Leu 


110 

110 




115 





120 





112 

CGC 

TTC 

AGC 

CGA GCG 

GTC 

CGG 

CTC 

TGC 

GGG 

ACG 

CTG 

CAG 

TAC 

520 

113 

Arg 

Phe 

Ser 

Arg Ala 

Val 

Arg 

Leu 

Cys 

Gly 

Thr 

Leu 

Gin 

Tyr 


114 


125 




130 





135 




116 

TTG 

GTG 

GCC 

ACG ATG 

CTG 

TAT 

ACA 

GGC 

ATC 

GTG 

ATC 

TAC 

GCG 

562 

117 

Leu 

Val 

Ala 

Thr Met 

Leu 

Tyr 

Thr 

Gly 

lie 

Val 

lie 

Tyr 

Ala 


118 



140 




145 





150 



120 

CCT 

GCG 

CTC 

ATC CTG 

AAC 

CAA 

GTG 

ACC 

GGG 

TTG 

GAC 

ATC 

TGG 

604 

121 

Pro 

Ala 

Leu 

lie Leu 

Asn 

Gin 

Val 

Thr Gly 

Leu 

Asp 

lie 

Trp 


122 




155 




160 





165 


124 

GCA 

TCG 

CTC 

CTG TCC 

ACA 

GGA 

ATC 

ATC 

TGC 

ACC 

TTG 

TAC 

ACT 

646 

125 

Ala 

Ser 

Leu 

Leu Ser 

Thr 

Gly 

lie 

lie 

Cys 

Thr 

Leu 

Tyr 

Thr 


126 




170 





175 






128 

ACC 

GTG 

GGT 

GGT ATG 

AAG 

GCC 

GTG 

GTC 

TGG 

ACA 

GAT 

GTG 

TTC 

688 

129 

Thr 

Val 

Gly 

Gly Met 

Lys 

Ala 

Val 

Val 

Trp 

Thr 

Asp 

Val 

Phe 


130 

180 




185 





190 





132 

CAG 

GTT 

GTG 

GTA ATG 

CTC 

GTT 

GGC 

TTC 

TGG 

GTG 

ATC 

CTG 

GCC 

730 
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133 Gin Val 

134 195 

136 CGA GGC 

137 Arg Gly 

138 

140 CTC GCT 

141 Leu Ala 

142 

144 CCT' GAT 

145 Pro Asp 

146 

148 GGT GGC 

149 Gly Gly 

150 250 

152 GCC CAG 

153 Ala Gin 

154 265 

156 GCC AAA 

157 Ala Lys 

158 

160 ATT GTG 

161 He Val 

162 

164 TAC TAC 

165 Tyr Tyr 

166 

168 GCC CCC 

169 Ala Pro 

170 320 

172 GAG GAT 

173 Glu Asp 

174 335 

176 TAC AGT 

177 Tyr Ser 

178 

180 ATG GCA 

181 Met Ala 

182 

184 CCT GGC 

185 Pro Gly 

186 

188 CTC TCA 

189 Leu Ser 

190 390 

192 CTG TCC 

193 Leu Ser 

194 405 

196 ACT GTG 

197 Thr Val 


Val Val Met 

GTC ATT CTC 
Val lie Leu 
210 

CAG AAC CAT 
Gin Asn His 
225 

CCT CGG AGC 
Pro Arg Ser 
240 

ACA CTG GTG 
Thr Leu Val 

GTA CAG CGC 
Val Gin Arg 

CTG GCC CTG 
Leu Ala Leu 
280 

GCC AGT GCA 
Ala Ser Ala 
295 

AAG GAC TGT 
Lys Asp Cys 
310 

GAC CAG TAC 
Asp Gin Tyr 

CTG CCC GGG 
Leu Pro Gly 

GGC ACC CTC 
Gly Thr Leu 
350 

GCT GTG ACT 
Ala Val Thr 
365 

CTG GCA CCT 
Leu Ala Pro 
380 

TTC ATC TAC 
Phe lie Tyr 

TCA CTG CTG 
Ser Leu Leu 

ATG GGT GTC 
Met Gly Val 


Leu Val Gly 
200 

CTG GGG GGT 
Leu Gly Gly 
215 

TCC CGG ATC 
Ser Arg lie 

CGG TAC ACC 
Arg Tyr Thr 

TGG CTC TCC 
Trp Leu Ser 
255 

TAT GTG GCC 
Tyr Val Ala 
270 

CTT GTC AAC 
Leu Val Asn 
285 

GCT TGC TGT 
Ala Cys Cys 

GAC CCC CTC 
Asp Pro Leu 

ATG CCG CTG 
Met Pro Leu 
325 

GTC CCC GGG 
Val Pro Gly 
340 

AGC ACT GCA 
Ser Thr Ala 
355 

GTG GAA GAC 
Val Glu Asp 

CGG AAG TTG 
Arg Lys Leu 

GGC TCT GCC 
Gly Ser Ala 
395 

GGA GGT GGT 
Gly Gly Gly 
410 

ATC AGT GGG 
lie Ser Gly 


Phe Trp Val 

CCC CGG AAC 
Pro Arg Asn 

AAC CTG ATG 
Asn Leu Met 
230 

TTC TGG ACT 
Phe Trp Thr 
245 

ATG TAC GGT 
Met Tyr Gly 
260 

TGC CAC ACA 
Cys His Thr 

CAG CTG GGC 
Gin Leu Gly 

GGC ATT GTC 
Gly lie Val 
300 

CTC ACA GGC 
Leu Thr Gly 
315 

CTT GTG TTG 
Leu Val Leu 
330 

CTC TTC CTG 
Leu Phe Leu 

TCC ACC AGC 
Ser Thr Ser 

CTC ATC AAG 
Leu lie Lys 
370 

GTT TTC ATC 
Val Phe lie 
385 

TGC CTC ACT 
Cys Leu Thr 
400 

GTC CTC CAG 
Val Leu Gin 

CCT CTA CTA 
Pro Leu Leu 


lie Leu Ala 
205 

GTG CTC AGC 
Val Leu Ser 
220 

GAC TTT GAC 
Asp Phe Asp 
235 

TTC ATA GTG 
Phe lie Val 

GTG AAC CAA 
Val Asn Gin 

GAG GGA AAG 
Glu Gly Lys 
275 

CTC TTC CTG 
Leu Phe Leu 
290 

ATG TTC GTC 
Met Phe Val 
305 

CGT ATC TCA 
Arg lie Ser 

GAC ATT TTT 
Asp lie Phe 

GCC TGT GCC 
Ala Cys Ala 
345 

ATC AAC GCC 
lie Asn Ala 
360 

CCG AGG ATG 
Pro Arg Met 
375 

TCT AAA GGG 
Ser Lys Gly 

GTG GCT GCT 
Val Ala Ala 

GGT TCC TTC 
Gly Ser Phe 
415 

GGC GCC TTC 
Gly Ala Phe 


772 

814 

856 

898 

940 

982 

1024 

1066 

1108 

1150 

1192 

1234 

1276 

1318 

1360 

1402 
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198 

200 

201 

202 

204 

205 

206 
208 

209 

210 
212 

213 

214 
216 

217 

218 
220 
221 
222 

224 

225 

226 
228 

229 

230 

232 

233 

234 

236 

237 

238 

240 

241 

242 
244' 

245 

246 

248 

249 

250 

252 

253 

254 
256 
258 
260 
262 
264 
266 
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420 425 430 

ACG CTT GGG ATG CTG CTC CCA GCC TGC AAC ACG CCA GGC GTT 

Thr Leu Gly Met Leu Leu Pro Ala Cys Asn Thr Pro Gly Val 

435 440 445 

CTC TCC GGG TTG GCA GCA GGC TTG GCT GTA TCC CTG TGG GTG 

Leu Ser Gly Leu Ala Ala Gly Leu Ala Val Ser Leu Trp Val 

450 455 

GCC GTA GGG GCC ACA CTG TAT CCC CCT GGA GAG CAG ACC ATG 

Ala Val Gly Ala Thr Leu Tyr Pro Pro Gly Glu Gin Thr Met 

460 465 


DATE: 05/21/2002 
TIME: 14:23:16 


470 


475 


480 


490 


495 


505 510 

GAT ACC TTT TAC GCC ATC TCC TAT CTC 
Asp Thr Phe Tyr Ala lie Ser Tyr Leu 
520 


530 

CTT ACT 
Leu Thr 
545 
CTG TGG 
Leu Trp 


535 


550 


560 565 

AAG GAA GAC ACT GCC ACC CTG GAG GAG 

Lys Glu Asp Thr Ala Thr Leu Glu Glu 

575 580 

CCG GAA GAC ATC CCT GCT GTG ACC AAG 

Pro Glu Asp lie Pro Ala Val Thr Lys 

590 

AAG CCA GGC GCC GAG ACC CAC CCC CTG 

Lys Pro Gly Ala Glu Thr His Pro Leu 

600 605 610 

GTG GAG ACC AAC CTC TGAGGGCGGG GTCCAAGAAG GCCAATCACA 
Val Glu Thr Asn Leu 
615 


GGCCTCGGGC CAGCAGCCTC CTCTCTGGAT GGTTGGACCT GAGCATATAT 
AGAAGCTTGG CTGATACATG CCCTGCCCAG AAGTCCCTGT GTCTTACCCG 
CACCAAAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG 
GAGTTGGTTC TCCATCCACA AAGGAAACCG TCTGGAACCT TCATGCCCTT 
GTAGATTTCA GTAGGCAGCG GAGAACACTC AGCTTCTCCA GACTGAGGTT 
TTCTCATTTA TCAGGCAGAG AAACGGAGGG CTGTCACCCC AACACCGGGG 


1444 


1486 


1528 


: TGC 

ACC 

AAC 

GAT 

TCG 

1570 

r Cys 

Thr 

Asn 

485 

Asp 

Ser 


: AAC 

GCT 

TCC 

AAC 

GGG 

1612 

■ Asn 

Ala 

Ser 

Asn 

500 

Gly 


1 CGC 

CCT 

GCC 

CTC 

GCT 

1654 

Arg 

Pro 

Ala 

Leu 

Ala 

515 


TAT 

TAC 

GGG 

GCT 

CTG 

1696 

Tyr 

525 

Tyr 

Gly 

Ala 

Leu 


GCT 

CTC 

ATC 

AGC 

TAC 

1738 

Ala 

Leu 

540 

He 

Ser 

Tyr 


CTG 

GGT 

CCC 

GGA 

TTG 

1780 

Leu 

Gly 

Pro 

555 

Gly 

Leu 


GCG 

TCT 

GTG 

GCC 

CCA 

1822 

Ala 

Ser 

Val 

Ala 

570 

Pro 


AGC 

CTG 

GTG 

AAG 

GGA 

1864 

Ser 

Leu 

Val 

Lys 

Gly 

585 


AAG 

CCC 

CCT 

GGC 

CTC 

1906 

Lys 

595 

Pro 

Pro 

Gly 

Leu 


TAT 

CTG 

GGG 

CAC 

GAT 

1948 

Tyr 

Leu 

Gly 

His 

Asp 



1993 


2043 

2093 

2143 

2193 

2243 

2293 
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268 AGGAGACAGT AGAAGGGTCA TAGATACAAA GAAAACTAAG GCAGAGGGAG 2343 
270 AAATGAATTG TCTACAGAGC ACAGAGCTCC AAGGATTGTG AAGCTACCTT 2393 
272 GAGGTGCCAA GGGACGGATT CTCAGAGCCT TCACAAGACA CAAACGGACG 2443 
274 AGTTGCCTCC TCCAATTCAG ATGGTTTGCA GACTATCAGA GAACATGTTT 2493 
276 CTCCTGTGAT CAGCTACCTA GCCTCTGCCA ACGTGTTCCA GCTTCCAGGA 2543 
278 GGCCACACAG ACCCCACCCC CCATGCTCTC ACCCTTTACC CCTGTGCTTT 2593 
280 TCACACACTA GGCAACTGCT CCACCACAGG ACCTCACACC TAGACCTCCG 2643 
282 TTTTTGACAC AGGGCCTTAA GGTAATCTGG CTGCCATCTG ACTATCTCTC 2693 
284 AGCACGTTCA CGTGTACAAT ATTTCATTCT TTTTCATTGC CAAGTTGTCT 2743 
286 TGTAAGGAGA GACCACAATG TGTCATCCAT GCCCAGCTTT TGTGTCTAAC 2793 
288 AAATAAAATC GCTGAAGGTG TTCAGGTGCA ATGGCCTGTG ACATTA 2839 


291 (2) INFORMATION FOR SEQ ID NO: 2: 

293 (i) SEQUENCE CHARACTERISTICS: 

294 (A) LENGTH: 618 

295 (B) TYPE: AMINO ACID 

296 (D) TOPOLOGY: LINEAR 

W--> 298 (ii) MOLECULE TYPE: 

299 (A) DESCRIPTION: PROTEIN 

301 (iii) HYPOTHETICAL: YES 

303 (vi) ORIGINAL SOURCE: 

304 (A) ORGANISM: RAT 

C > 305 (C) INDIVIDUAL ISOLATE: SODIUM/IODIDE SYMPORTER 

307 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

311 Met Glu Gly Ala Glu Ala Gly Ala Arg Ala Thr 

312 1 5 10 

314 Phe Gly Ala Trp Asp Tyr Gly Val Phe Ala Thr Met Leu Leu 

315 15 20 25 

317 Val Ser Thr Gly lie Gly Leu Trp Val Gly Leu Ala Arg Gly 

318 30 35 

320 Gly Gin Arg Ser Ala Asp Asp Phe Phe Thr Gly Gly Arg Gin 

321 40 45 50 

323 Leu Ala Ala Val Pro Val Gly Leu Ser Leu Ala Ala Ser Phe 

324 55 60 65 

326 Met Ser Ala Val Gin Val Leu Gly Val Pro Ala Glu Ala Ala 

327 70 75 80 

329 Arg Tyr Gly Leu Lys Phe Leu Trp Met Cys Ala Gly Gin Leu / 

330 85 90 95 

332 Leu Asn Ser Leu Leu Thr Ala Phe Leu Phe Leu Pro lie Phe 

333 100 105 

335 Tyr Arg Leu Gly Leu Thr Ser Thr Tyr Gin Tyr Leu Glu Leu 

336 110 115 120 

338 Arg Phe Ser Arg Ala Val Arg Leu Cys Gly Thr Leu Gin Tyr 

339 125 130 135 

341 Leu Val Ala Thr Met Leu Tyr Thr Gly lie Val He Tyr Ala 

342 140 145 150 

344 Pro Ala Leu lie Leu Asn Gin Val Thr Gly Leu Asp lie Trp / 

345 155 160 165 f 

347 Ala Ser Leu Leu Ser Thr Gly lie lie Cys Thr Leu Tyr Thr 

348 170 175 
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L:28 M:220 C: 

L:29 M:220 C: 
L:66 M:220 C: 
L:57 M:246 W: 
L:305 M:220 C: 
L:298 M:246 W: 


Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 

Keyword misspelled or invalid format, [(B) FILING DATE:] 

Keyword misspelled or invalid format, [(C) INDIVIDUAL ISOLATE:] 

Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], SeqNo=l 
Keyword misspelled or invalid format, [(C) INDIVIDUAL ISOLATE:] 

Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], SeqNo=2 
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